Variational Inference for Crowdsourcing

نویسندگان

  • Qiang Liu
  • Jian Peng
  • Alexander T. Ihler
چکیده

Crowdsourcing has become a popular paradigm for labeling large datasets. However, it has given rise to the computational task of aggregating the crowdsourced labels provided by a collection of unreliable annotators. We approach this problem by transforming it into a standard inference problem in graphical models, and applying approximate variational methods, including belief propagation (BP) and mean field (MF). We show that our BP algorithm generalizes both majority voting and a recent algorithm by Karger et al. [1], while our MF method is closely related to a commonly used EM algorithm. In both cases, we find that the performance of the algorithms critically depends on the choice of a prior distribution on the workers’ reliability; by choosing the prior properly, both BP and MF (and EM) perform surprisingly well on both simulated and real-world datasets, competitive with state-of-the-art algorithms based on more complicated modeling assumptions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Derivation of the Belief Propagation Algorithm

This document contains derivations and other supplemental information for the NIPS 2012 submission , " Variational Inference for Crowdsourcing ". We derive the belief propagation algorithm (15) in Theorem 3.1.

متن کامل

Fast Inference for Interactive Models of Text

Probabilistic models are a useful means for analyzing large text corpora. Integrating such models with human interaction enables many new use cases. However, adding human interaction to probabilistic models requires inference algorithms which are both fast and accurate. We explore the use of Iterated Conditional Modes as a fast alternative to Gibbs sampling or variational EM. We demonstrate sup...

متن کامل

Early Gains Matter: A Case for Preferring Generative over Discriminative Crowdsourcing Models

Introduction. Here we derive mean field variational updates for MOMRESP. Although this derivation is largely a mechanical exercise, it is our belief that there is a contingent of crowdsourcing practitioners whose background is more practical than theoretical and who may appreciate seeing the mechanics of mean-field variational inference presented in a high level of detail for a model they are f...

متن کامل

Crowd-Selection Query Processing in Crowdsourcing Databases: A Task-Driven Approach

Crowd-selection is essential to crowdsourcing applications, since choosing the right workers with particular expertise to carry out specific crowdsourced tasks is extremely important. The central problem is simple but tricky: given a crowdsourced task, who is the right worker to ask? Currently, most existing work has mainly studied the problem of crowd-selection for simple crowdsourced tasks su...

متن کامل

Nference - R Ules via C Rowdsourcing

The importance of inference rules to semantic applications has long been recognized, and extensive work has been carried out to automatically acquire inference-rule resources. However, despite their potential, the utilization of inference rule resources is currently somewhat limited, in part due to the considerable number of rules which are in fact invalid. A possible solution to this problem i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012